NTT System Description for the WMT2006 Shared Task

نویسندگان

  • Taro Watanabe
  • Hajime Tsukada
  • Hideki Isozaki
چکیده

We present two translation systems experimented for the shared-task of “Workshop on Statistical Machine Translation,” a phrase-based model and a hierarchical phrase-based model. The former uses a phrasal unit for translation, whereas the latter is conceptualized as a synchronousCFG in which phrases are hierarchically combined using non-terminals. Experiments showed that the hierarchical phrasebased model performed very comparable to the phrase-based model. We also report a phrase/rule extraction technique differentiating tokenization of corpora.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

NAIST-NTT System Description for Patent Translation Task at NTCIR-7

This paper proposes a semi-supervised approach to acquire domain specific translation knowledge from the collection of Wikipedia. The proposed method starts from a small number of seed translation pairs for each domain in a given corpus, and applies the regularized Laplacian to learn translation pairs relevant to the domain. This paper presents evaluation results using the NTCIR-7 Patent Transl...

متن کامل

The Effect of Different Task Types on Learning Prepositions in Form–Focused and Meaning–Focused Interaction Enhancement-Based Classes

The current study examines the impact of different task types on learning prepositions in form and meaning- focused interaction enhancement- based classes. The participants were 57 second Year University students enrolled in three intact lab classes at Tabriz Islamic Azad University.  The first group was provided with form-focused interaction enhancement, the second with the meaning-focused int...

متن کامل

NTT-NII Statistical Machine Translation for NTCIR-10 PatentMT

This paper describes details of the NTT-NII system in NTCIR10 PatentMT task. The system is an extension of the NTTUT system in NTCIR-9 by: a new English dependency parser (for EJ task), a syntactic rule-based pre-ordering (for JE task), a syntax-based post-ordering (for JE task). Our system ranked 1st in EJ subtask both in automatic and subjective evaluation, and was the best SMT system in JE s...

متن کامل

NTT Communication Science Laboratories and National Institute of Informatics at TRECVID 2012 Instance Search and Multimedia Event Detection Tasks

This paper reports our methods and experimental results for two TRECVID 2012 tasks of instance search and multimedia event detection. For the instance search, we applied the BM25(Best Match 25) method, the state-of-the-art ranking function in the field of text retrieval, to the video retrieval task. Our BM25 features the following three factors: (i) the keypoint importance degree (ii) keypoint ...

متن کامل

NTT-UT Statistical Machine Translation in NTCIR-9 PatentMT

This paper describes details of the NTT-UT system in NTCIR9 PatentMT task. One of its key technology is system combination; the final translation hypotheses are chosen from n-bests by different SMT systems in a Minimum Bayes Risk (MBR) manner. Each SMT system includes different technology: syntactic pre-ordering, forest-to-string translation, and using external resources for domain adaptation a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006